switchBox: An R package for k-Top Scoring Pairs (kTSP) classifier development
نویسندگان
چکیده
Summary: k-Top scoring pairs (kTSP) is a classification method for prediction from high throughput data based on a set of the paired measurements. Each of the two possible orderings of a pair of measurements (e.g., a reversal in the expression of two genes) is associated with one of two classes. The kTSP prediction rule is the aggregation of voting among such individual two-feature decision rules based on order switching. kTSP, like its predecessor, TSP, is a parameter-free classifier relying only on ranking of a small subset of features, rendering it robust to noise and potentially easy to interpret in biological terms. In contrast to TSP, kTSP has comparable accuracy to standard genomics classification techniques, including Support Vector Machines (SVM) and Prediction Analysis for Microarrays (PAM). Here, we describe “switchBox,” an R package for kTSP-based prediction. Availability: The “switchBox” package is freely available from Bioconductor: http://www.bioconductor.org Contact: [email protected]
منابع مشابه
switchBox: an R package for k-Top Scoring Pairs classifier development
UNLABELLED k-Top Scoring Pairs (kTSP) is a classification method for prediction from high-throughput data based on a set of the paired measurements. Each of the two possible orderings of a pair of measurements (e.g. a reversal in the expression of two genes) is associated with one of two classes. The kTSP prediction rule is the aggregation of voting among such individual two-feature decision ru...
متن کاملRgtsp: a generalized top scoring pairs package for class prediction
SUMMARY A top scoring pair (TSP) classifier consists of a pair of variables whose relative ordering can be used for accurately predicting the class label of a sample. This classification rule has the advantage of being easily interpretable and more robust against technical variations in data, as those due to different microarray platforms. Here we describe a parallel implementation of this clas...
متن کاملBioconductor’s tspair package
The tspair package contains functions for calculating the top scoring pair for classification of high-dimensional data sets [1]. A top scoring pair is a pair of genes whose relative ranks can be used to classify arrays according to a binary phenotype. A top scoring pair classifier has three advantages over standard classifiers: (1) the classifier is based on the relative ranks of genes and is m...
متن کاملThe tspair package for finding top scoring pair classifiers in R
UNLABELLED Top scoring pairs (TSPs) are pairs of genes whose relative rankings can be used to accurately classify individuals into one of two classes. TSPs have two main advantages over many standard classifiers used in gene expression studies: (i) a TSP is based on only two genes, which leads to easily interpretable and inexpensive diagnostic tests and (ii) TSP classifiers are based on gene ra...
متن کاملA Generic Framework for Top-k Pairs and Top-k Objects Queries over Sliding Windows
Top-k pairs and top-k objects queries have received significant attention by the research community. In this paper, we present the first approach to answer a broad class of top-k pairs and top-k objects queries over sliding windows. Our framework handles multiple top-k queries and each query is allowed to use a different scoring function, a different value of k and a different size of the slidi...
متن کامل